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ABSTRACT 

We address the correlations of black hole (BH) mass with four different host- 
galaxy properties from 11 existing data sets. For the purpose of guiding theo- 
retical understanding, we first try to quantify the tightness of the intrinsic cor- 
relations. We assume that all of the relations are power laws and perform linear 
regressions that are symmetric in the two variables on the logarithms of the data 
points. Given the estimated measurement errors, we evaluate the probability 
distribution of the residual variance in excess of that expected from the measure- 
ment errors. Our central result is that the current data sets do not allow definite 
conclusions regarding the quality of the true correlations because the obtained 
probability distributions for the residual variance overlap for most quantities. 
Velocity dispersion as collected by Merritt and Ferrarese (ctmf) and galaxy light 
concentration as measured by Graham and coworkers (CR e ) are consistent with 
zero residual variance. Taken at face value, this means that these two correlations 
are better than the others, but this conclusion is highly sensitive to the assumed 
measurement errors and would be undone if the present estimated errors were 
too large. We then consider which of the relations offer the best inferences of 
BH mass when there is no direct measurement available. As with the residual 
variances, we find that the probability distribution of expected uncertainty in 
inferred BH masses overlaps significantly for most of the relations. Photomet- 
ric methods would then be preferred because the data are easier to obtain, as 
long as bulge-disk decomposition or detailed modeling of the photometric profile 
(as studied by Graham and coworkers do not present problems. Determining 
which correlation offers the best inferences requires reducing the uncertainty in 
the expected error in the inferred BH masses (the "error on the error"). This 
uncertainty is currently limited by uncertainty in the residual variance for all of 
the relations. The only quantities for which BH mass inferences are limited by 
measurement error are o"mf and Cr 6 - Therefore, if these relations are truly better 
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than the others, then new, improved measurements should allow improved infer- 
ences of BH masses. If they do not, the conclusion must be that the present low 
residual variances for these two relations result from overestimated error bars. 

Subject headings: black hole physics — galaxies: bulges — galaxies: fundamental 
parameters — galaxies: nuclei — galaxies: kinematics and dynamics 

1. Introduction 

There is evidence for the presence of massive BHs in the centers of early-type galaxies 
(e.g., Kormendy & Richstone 1995). Spectroscopic and photometric data of high spatial 
resolution from the Hubble Space Teles cope have made it possible to derive relatively accurate 
BH masses for a number of nearby galaxies (Kormendy & Gebhardt 2001). This has led to 
empirical derivations of relationships between the BH mass and several different properties of 
its host galaxy, such as bulge mass (Magorrian et al. 1998; Haring & Rix 2004), stellar velocity 
dispersion (Gebhardt et al. 2000; Ferrarese & Merritt 2000), bulge luminosity (Kormendy 
1993; McLure & Dunlop 2002; Marconi & Hunt 2003; Bettoni et al. 2003), and galaxy light 
concentration (Graham et al. 2001). 

These correlations pose a theoretical challenge because the mass accretion onto BHs 
takes place on extremely small spatial scales compared to the scales corresponding to these 
global properties of galaxies. For the purpose of constructing a theoretical model explaining 
these correlations, it is essential to establish which of these correlations has the smallest 
intrinsic residual variance, and therefore has the best chance of being causally linked to the 
BH mass. This is a difficult problem because the global properties are correlated with each 
other, as well as with the central BH mass. 

We analyze the correlations between BH mass and several global galactic properties as 
described in detail in §3. An interesting quantity for comparing tightness of the correla- 
tions is the "residual" variance, which is the excess variance in the correlation that is not 
explained by the observational errors. The residual variance reflects either the true scatter 
due to hidden variables other than the two variables whose correlation is analyzed (what one 
might call "intrinsic" variance) or spurious scatter in the correlation due to underestimates 
of observational or systematic errors. We posit that the most physically meaningful corre- 
lation is that with the smallest residual variance. Identifying that correlation is termed the 
"Theorist's Question." In addition to inferring the most likely value of the residual variance 
given the data, we also consider the uncertainty in our inferences about this quantity. 

A second question is which correlation allows the most accurate inferences of BH mass 
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when a direct measurement of it is unavailable. We term this the "Observer's Question," 
which is distinct from the Theorist's because the quantity serving as a predictor of BH mass 
has observational errors. For example, the correlation with a given quantity could be the 
theorist's favorite, with zero residual variance, while being useless for inferring BH masses 
because of the large errors involved in measuring it. Observers would also like the quantity 
under consideration to be easy to measure in terms of telescope time. Such considerations 
do not enter into the statistical part of the problem, but it may be desirable lose a small 
amount of accuracy to save a large amount of observational effort. 

Section 2 describes our method of analysis, §3 discusses the data sets upon which this 
analysis is based, and §4 contains a discussion of our results. Finally, §5 presents a review 
of our conclusions. 



2. Method 

The data in each case consist of N points {xi,yi}, where yi is the base-10 logarithm of 
the measured BH mass and Xi is the base-10 logarithm of the measured galaxy quantity (in 
the case of velocity dispersion or mass) or the quantity itself (in the case of magnitude or 
light concentration). Also available are the associated 1 a measurement error bars on each 
quantity, a yi and a xi . We assume Gaussian errors. 

We assume that all of the relations between the logarithms are linear (power laws). If 
one relation has a functional form more complex than a power law, our model will assign it 
an unfairly high intrinsic variance. A model more complex than a power law may decrease 
the intrinsic variance of a given correlation, but it will also involve a larger number of model 
parameters. 

The objects in each data set are nearby galaxies with central BH masses derived from 
stellar kinematics, gas dynamics, or masers. Some data sets include galaxies of all Hubble 
types, while others are restricted to early types. See §3 for details on the individual samples. 

2.1. Theorist's Question 

We first seek to compare estimates of the residual variance (e 2 ) resulting from the 
correlation of several different galactic quantities with BH mass. We wish to find the quantity 
that yields the smallest residual variance, and therefore the tightest correlation. The hope 
is that the residual variance is the "intrinsic" variance. 
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As pointed out in Tremaine et al. (2002) and elsewhere, in the absence of information 
about the direction of the causal link between BH mass and other galactic properties, there is 
no natural division of the variables into "dependent" and "independent" variables. Therefore, 
x and y are treated as symmetrically as possible. 

In extensive Monte Carlo simulations (G. S. Novak et al. 2006, in preparation) to 
investigate the performance of many fitting methods with small numbers of data points, 
errors in both coordinates, and the presence of residual variance, we found that the algorithm 
defined by Tremaine et al. (2002) based on FITEXY (Press et al. 1992) estimates the slope 
with the least bias and variance. Our results agree with those of Tremaine et al. (2002), 
that the competing estimator defined by Akritas & Bershady (1996) gives reliable estimates 
of the slope only if the spread of rc-values is large compared to their errors and if the x- and 
^/-errors of all points are comparable. If the spread of the x-values is too small compared 
to the x-errors, then the Akritas & Bershady (1996) estimator develops a bias toward large 
slopes. If some points have error bars that are much larger than the other points, then the 
Akritas & Bershady (1996) estimator becomes inefficient. A statistical estimator is inefficient 
if there is another estimator with smaller variance (Cowan 1998). 

The same Monte Carlo simulations also confirmed that, even using the extension of 
the FITEXY estimator defined by Tremaine et al. (2002), it is unfortunately impossible 
to avoid specifying about whether the residual variance is in the x-coordinate or in the y- 
coordinate. Assuming that the residual variance is in the y-coordinate when in truth it is in 
the x-coordinate results in systematically low slopes, and vice versa. 

The Monte Carlo study included unweighted least squares fitting, weighted least squares 
fitting, weighted orthogonal regression (as defined by the FITEXY routine), the modified 
FITEXY routine defined by Tremaine et al. (2002), the estimator defined by Akritas & 
Bershady (1996), a Bayesian estimator defined by Gull (1989), and a Bayesian estimator we 
defined using an unfinished manuscript by Jaynes (1991, unpublished). 1 If a given estimator 
treats x and y asymmetrically, we considered the same estimator with x and y interchanged. 
Also included were the lines defined by the arithmetic, geometric, and harmonic means of 
these two slopes, as well as the bisector line. The modified FITEXY estimator gave the 
most efficient and unbiased estimates of the slope when the fitting method incorporated the 
residual variance into the correct coordinate. When the fitting method includes residual 
variance in the wrong coordinate, the slope found by all estimators is biased as noted above 
(G. S. Novak et al. 2006, in preparation). 

Therefore, we use the FITEXY estimator defined by Press et al. (1992) as modified by 
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Tremaine et al. (2002) to include residual variance. That is, for a given value of e we find 
the values of a and [3 that minimize 



where a and (3 are the offset and slope of the correlation, respectively, xq is a value chosen 
near the mean of the distribution of x-values in order to reduce covariance between a and 
j3, and c = 1 if the residual variance is assumed to be in the ^/-coordinate while c = if it 
is assumed to be in the x-coordinate. Note that e will have different units depending on the 
value of c. If the reduced y 2 of the fit is not equal to 1, we adjust e and refit until it is. 
Finally, to estimate the upper and lower limits on e, we adjust e and refit until the reduced 
X 2 is equal to 1 ± (2/iV) 1 / 2 . This gives an estimate of the la error bar on e. 

The fact that one must specify the coordinate containing the residual variance com- 
plicates comparisons to theoretical models. If one develops a theoretical model in which 
the galaxy velocity dispersion regulates BH growth, such as Adams et al. (2001), then the 
residual variance would effectively lie in the BH mass. On the other hand, in a theoretical 
model in which the BH profoundly affects the structure of its host galaxy through quasar 
activity, such as Silk & Rees (1998), the residual variance would lie in the galaxy property. 
Therefore, theoretical models should attempt to match different slopes depending on the 
structure of the model. 

However, blind comparisons to the different slopes can run into problems because of 
observational selection effects. Galaxies are selected for BH mass studies based on their 
properties, not their as-yet-unknown BH mass, so regarding the BH mass as the independent 
variable for a linear fit will result in selection criteria that are difficult to quantify. For 
example, if galaxies are selected based on a hard limit in the galaxy property and one then 
regards the BH mass as the independent variable, the slope will be systematically low. 

The best procedure would be for theoretical models to provide the full probability 
distribution linking BH mass and other galactic properties. It would then be possible to 
make any desired comparison to observational data while properly accounting for selection 
effects. Unfortunately, theoretical models for BH formation are generally far from this level 
of precision. 

The answer to the Theorist's Question is extremely sensitive to inaccurate estimates of 
measurement errors. If measurement errors are underestimated, then the fit will find too 
much residual variance. Similarly, if the measurement errors are overestimated, the fit will 
find too little residual variance. Finally, if an observer overestimates measurement errors so 
severely that the correlation becomes "too good," in the sense that the observed scatter is 
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less than that expected from the error bars, the result will be a e distribution sharply peaked 
around zero, and the reduced x 2 of the linear fit will be less than 1 even when e is set to 
zero. 



2.2. Observer's Question 

Next, we wish to quantify the predictive power of each correlation. If we had a proba- 
bility distribution for the model parameters, the machinery of Bayesian statistics could be 
used to construct the predictive distribution for BH masses given a predictor variable. This 
is the probability distribution for an about-to-be- measured y- value given a just-measured re- 
value and model parameters that are constrained by all of the previously collected data. The 
important thing about the predictive distribution in Bayesian statistics is that it includes 
both uncertainty in the model parameters and expected observational uncertainty 2 . 

In the Bayesian formalism, usually one makes a x 2 estimator into a probability distribu- 
tion for the model parameters via p(M\D) oc p(M) exp(— % 2 (M, D)/2), where x 2 is given by 
equation 1, M is a set of model parameters, and D is a set of data. Unfortunately, allowing 
observational errors in both x and y makes the distribution for the model parameters depend 
nontrivially on the prior distribution of the unobservable parameter (BH mass in our case) 
in a manner that does not become less important as the number of data points grows (as is 
usually the case; Gull (1989)). Therefore, the standard Bayesian predictive distribution is 
not available, but we can construct a "poor man's" estimate of the desired quantity — the un- 
certainty in inferences about y given both observational uncertainty in x and uncertainty in 
the model parameters — by combining variances in the standard way. The expected variance 
is 

a 2 y = (x- :r ) V 2 + P 2 a 2 x + a 2 a + [c + (1 - c)/3 2 ]e 2 , (2) 

where a y is the expected uncertainty in a "new" y-value, a x is the expected observational 
uncertainty in a new x-value, and erg and o a are the uncertainties in the slope and offset 
given the data collected thus far. 

It is important to realize that the answer to the Observer's Question is far less sensitive 
to inaccurate estimates of measurement errors than the answer to the Theorist's Question. 
Either over- or underestimating the error bars will mislead the theorist, whose main interest 
is residual variance, but the total uncertainty resulting from the combination of residual 
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variance and measurement error will remain virtually unchanged. Therefore, the quality of 
the observer's inferences about BH masses will be unchanged. 

Distance uncertainty has a different impact on the Theorist's and Observer's questions. 
The theorist would prefer a correlation where both variables scale the same way with distance, 
such as BH mass and bulge mass. This will preserve a tight correlation even when the 
distances to the individual objects are uncertain. The observer would prefer a quantity that 
does not scale with distance, such as velocity dispersion on galaxy light concentration, so 
that BH mass inferences will not depend sensitively on the unknown distance to the object. 
Thus, the theorist will have an easy time calibrating a tight relation, but a hard time using 
that relation to infer BH masses. The observer will have a hard time calibrating a relation 
(because distances must be known accurately), but an easier time inferring BH masses after 
the calibration is finished. 



3. Data Sets 

We analyze several observable galaxy properties that correlate with BH mass. Although 
reverberation mapping studies in active galactic nuclei are rapidly converging to the existing 
BH correlations for inactive local galaxies in terms of residual variance (Onken et al. 2004), 
in order to maintain homogeneity they are excluded from this study. We use the BH masses 
and distances quoted in each paper with the exception of NGC 821, for which we revise the 
BH mass estimate as indicated by Richstone et al. (2004), scaling to a different distance if 
necessary. 

Our core sample is that of Gebhardt et al. (2003) because this sample consists of 10 
E and 2 SO galaxies with BH masses and M/L values all inferred from similar dynamical 
models of the galactic potential. The sample is therefore homogeneous. The derived bulge 
masses do not suffer from large perturbations due to the presence of small disks. 

The velocity dispersions are denoted ctg- Following Tremaine et al. (2002) we assume 
5% errors on the velocity dispersions. 

Absolute 5-band luminosities for the Gebhardt et al. (2003) galaxies, denoted M B) are 
taken from the same paper. The dominant source of error is the random error on the distance 
from Tonry et al. (2001). 

The bulge masses for the Gebhardt et al. (2003) galaxies are denoted mtmige- Absolute 
5-band luminosities and V"-band M/L values are from the same paper, distances are from 
Tonry et al. (2001), and B — V colors are from Faber et al. (1997) or de Vaucouleurs et al. 
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(1995) as indicated in Table 1. The M/L value is derived from the same dynamical models 
used to estimate BH mass. The uncertainty in the bulge mass comes from the published 
uncertainty in M/L combined with the random errors in the distance from Tonry et al. 
(2001) propagated through the absolute magnitude using standard error propagation. 

A simpler estimate of the bulge mass comes from the homology assumption in which 
mpp = kR e cr 2 /G, where R e is the half-light radius, a is the velocity dispersion, G is the 
gravitational constant, and A; is a structure constant (assumed to be the same for all galaxies 
in the sample) that depends on the exact form of the mass and velocity distributions. To 
compute mpp, velocity dispersions are taken from Gebhardt et al. (2003), distances from 
Tonry et al. (2001), and R e values from Faber et al. (1997), de Vaucouleurs et al. (1995), or 
Faber et al. (1989) as indicated in Table 1. The dominant error is uncertainty in the distance 
propagated through R e using standard error propagation. 

This core sample is enlarged by considering other data sets. Values and error bars are 
in general adopted from the published values. Tremaine et al. (2002) include the galaxies 
from Gebhardt et al. (2003) plus 19 additional galaxies over a wide range of Hubble types. 
Velocity dispersions for these galaxies are denoted a T . 

Merritt & Ferrarese (2001) collected BH mass estimates for 27 galaxies over a wide range 
of Hubble types. They took velocity dispersions from the literature and corrected them to 
dispersions within R e /8 (denoted 0mf) using an empirical formula. 

Graham et al. (2001) collected BH mass estimates for 21 galaxies and used galaxy light 
concentration, denoted Cn e , as the BH mass predictor variable. They define Cn e to be twice 
the fraction of the galaxy's light enclosed within 1/3 of the half-light radius. They computed 
Cpte by fitting a Sersic profile to galaxy images and then using an analytic formula to convert 
the Sersic index n to Cr c . They estimate the error in Cr 6 based on previous experience with 
the fitting procedure. 

Marconi & Hunt (2003) collected 37 galaxies spanning a wide range in Hubble types and 
advocated the use of near-infrared luminosity to predict BH mass. They retrieved J-band 
apparent magnitudes from the 2MASS survey and used distances from Tonry et al. (2001) or 
Hubble velocities corrected for Virgo-centric infall to convert them to absolute magnitudes. 
They do not quote uncertainties in Mj; we estimate them from the distance errors quoted 
by Tonry et al. (2001), or, where not available, we assume 0.5 mag of uncertainty in the 
distance modulus. They exclude several galaxies, including those for which the BH sphere 
of influence is less than one resolution element. 

Haring & Rix (2004) used spherical, isotropic Jeans modeling to estimate the bulge 
masses of 10 galaxies and drew bulge mass estimates from the literature for an additional 
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20 galaxies, including 12 from Magorrian et al. (1998). They used BH mass estimates 
mostly from Tremaine et al. (2002) to argue that the BH-mass-to-bulge-mass correlation 
has similar residual variance to the other correlations under consideration here. Their bulge 
mass estimates are denoted mH aer - 

McLure & Dunlop (2002) drew the E-type galaxies from the sample published in Ko- 
rmendy & Gebhardt (2001) (discarding two because the uncertainties in the BH masses 
were larger than for the rest of the sample) and used 5-band luminosities from Faber et al. 
(1997), y-band luminosities from Merritt & Ferrarese (2001), and average color corrections 
from Fukugita et al. (1995) to get i?-band luminosities for 18 galaxies. The dominant error 
comes from the uncertainty in the distance from Tonry et al. (2001). Where distances based 
on surface brightness fluctuations are not available, we assume 0.5 mag uncertainty in the 
distance modulus. They argued that the problem with the BH-mass-to-bulge-luminosity 
relation was simply bulge-disk decomposition, and they claimed that removing spiral and 
SO galaxies resulted in a very good correlation. They do not provide a data table, but they 
do describe how they selected their sample. We attempt to reconstruct it. Their i?-band 
absolute magnitudes are denoted M# ; m, where the M stands for McLure. 

This notation is necessary because Bettoni et al. (2003) also used i?-band luminosities 
in a very similar sample. They also drew only E-type galaxies from Kormendy & Gebhardt 
(2001), resulting in a sample of 20 galaxies. They drew 5-band luminosities from Faber 
et al. (1997) for all but three galaxies, for which they obtained 5-band luminosities from 
de Vaucouleurs et al. (1995). To get i?-band luminosities they used colors from Prugniel 
& Maubon (2000) or de Vaucouleurs et al. (1995). Their i?-band magnitudes are denoted 
M R:B . 

To simplify the analysis we make all error bars symmetric about the preferred value by 
averaging the size of the upper and lower 1-a error bars so that becomes x ± (h + /)/2. 
This simplification should not greatly affect the result, since most of the confidence regions 
are reasonably symmetric in log space. 

4. Results 

4.1. Theorist's Question 

Figure 1 shows the raw data for the 11 global galaxy quantities described in §3. We 
apply the method described in §2.1 to all of these variables in order to infer the residual 
variance associated with each linear fit. 
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Above, we noted that one's choice about how to include residual variance — whether in 
x or in y — affects estimates of the slope of a given correlation. For the galaxies listed in 
Tremaine et al. (2002), the slope is 4.59 ± 0.34 or 4.10 ± 0.30 depending on whether the 
residual variance is assigned to x or y, respectively. For Merritt & Ferrarese (2001) galaxies, 
the slope is 4.54 ±0.38 or 4.42 ±0.37. One might choose any one of these numbers as a point 
of comparison for a theoretical model, depending on the content of the model. 

The most straightforward comparison among the different correlations is obtained under 
the assumption that the residual variance is always in the BH mass, so that it has the same 
units in all cases. Figure 2 shows the probability distribution of possible e values given the 
data sets published for each correlation under consideration. We estimate the 1 a error on 
e as described in §2.1 and then plot a Gaussian distribution of the absolute value of e in 
order to emphasize the extent to which the allowed ranges overlap. These distributions of e 
are most relevant for answering the Theorist's Question, which asks what quantity is most 
closely associated with BH mass for the purpose of constructing theoretical models. The 
best intrinsically correlated quantity will have the distribution of e closest to zero. 

The differences between the various correlations are not highly significant, as most dis- 
tributions overlap substantially. This indicates that the available data do not constrain the 
residual variance well enough to make strong statements about which correlation is intrinsi- 
cally tighter than the others. The galaxy light concentration CR e and the velocity dispersion 
<7mf as measured by Merritt & Ferrarese (2001) show residual variance distributions that 
are peaked at lower values than the others, and each is consistent with zero. At face value, 
this means that these two correlations are better than the others, provided that their error 
estimates are accurate. 

There seems to be a slight systematic trend among the types of quantities — galaxy light 
concentration has the smallest residual variance, followed by the three velocity dispersions, 
then the four luminosities, and finally the three masses. However, it is difficult to make much 
of this, since the individual distributions overlap significantly, and many of the different 
samples are correlated because they have many galaxies in common. 

If one omits the galaxies for which the BH sphere of influence is unresolved according to 
Marconi & Hunt (2003), all of the curves become less sharply peaked, because the samples 
have gotten smaller. This effect is largest for Cr c , where the 1 a upper limit goes from 0.15 
to 0.23. This is a somewhat larger change than one would expect just from the change in 
sample size, indicating that the unresolved galaxies happen to lie very close to the fitted line 
for this relation. 

The question of which correlation is intrinsically tightest will remain open until more 
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accurate data become available or the number of objects increases dramatically. There is 
some indication that reverberation mapping may become useful in constraining the residual 
variance of the BH mass correlations by greatly increasing the number of objects with BH 
mass estimates of sufficient precision (Onken et al. 2004). 

If one is willing to view the many experiments undertaken in the literature involving 
various subsamples, various galaxy variables, and various corrections as a way of learning 
about the distribution of e via an empirical bootstrap-like error estimate, the conclusion 
agrees with our result nicely: quoted numbers for e range from ~0.2 to ~0.5 dex under a 
wide variety of assumptions. 

In spite of the fact that the Theorist's Question is not answerable at this time, we 
include a compilation of our fits to all of the data sets assuming residual variance in both 
the x- and y-coordinates in Tables 2 and 3, for the purpose of comparing the slopes of the 
many theoretical models to the observations. 

Table 4 provides Pearson and Spearman correlation coefficients along with confidence 
regions. The Pearson and Spearman values do not differ markedly except for the smallest 
data sets, which contain 12 galaxies. For these data sets, the confidence regions reveal that 
the coefficients are essentially unconstrained. 

4.2. Observer's Question 

Next we consider which correlation best predicts the BH masses of galaxies for which 
there is no direct measurement. Such correlations are in great demand to answer a variety 
of astrophysical questions (e.g., Salucci et al. 1999; Aller & Richstone 2002; Yu & Tremaine 
2002; Barth et al. 2003; Marconi et al. 2004). Here the relevant quantity is the predictive 
power of each correlation, which we explore with our poor man's predictive distribution 
defined in §2.2. Inherent in this approach is the assumption that future data sets used to 
predict BH masses will have the same predictor measurement errors as those discussed here. 
The method can handle other assumptions, but we do not pursue them here. 

For each predictor variable, we consider a hypothetical new measurement with observa- 
tional error equal to the mean of the absolute value of the observational errors for existing 
measurements of that predictor variable. All of the residual variance contributes to un- 
certainty in the BH mass, so we do the fits assuming that the residual variance is in the 
y-coordinate. Hypothetical x-values are chosen to lie near the center of the range of ob- 
served values. For velocity dispersions, x = 190 km s _1 , for bulge masses x = 6 x 10 10 M Q , 
for galaxy light concentration x = 0.435, for 5-band magnitude x = —20.4, for J-band 
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Fig. 2. — Probability distributions for residual variance for 11 BH mass predictor variables. 
Residual variance is the variance that is left over after observational errors are accounted 
for; a smaller residual variance means the correlation is intrinsically tighter. For most of 
the correlations, the available data do not constrain the residual variance well enough to 
make strong statements about which correlation is preferred over the others. The only 3 a 
statement that can be made is that Cr c is better than mpp, ^Haer, and Mj, and this depends 
critically on the assumption that errors on all predictor variables are accurately estimated. 
The galaxy light concentration (Cpt e ) and velocity dispersion as measured by Merritt & 
Ferrarese (2001) have residual variance distributions that are peaked at lower values than 
the others. Both relations are consistent with zero residual variance. Figure 3 shows that 
they are also the only two relations with predictive uncertainties in Mbh dominated by 
measurement error rather than residual variance. If these relations truly are better than the 
others, then more accurate measurements of the predictor variable will allow very good BH 
mass inferences. 
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FrcdicLivc uncertainty in M RH (dcx) 

Fig. 3. — Expected uncertainty in inferred BH mass for a new measurement of the predictor 
variable which results in a BH mass near 10 8 M Q (near the center of the observed range). 
Near the edges of the range, the expected uncertainty goes up somewhat because the slope 
of each relation in not known precisely. The peak of each distribution comes from equation 
2, and the width is determined by 100 bootstrap Monte Carlo samples on each data set as 
described in §4.2. If a curve is peaked in the same place as in Figure 2, then the predic- 
tive uncertainty is dominated by residual variance; if a curve moves significantly, then the 
predictive uncertainty is dominated by measurement error. Galaxy light concentration Cr c 
and velocity dispersion <tmf are dominated by measurement error, while the others are dom- 
inated by residual variance. The width of each curve (the error on the error) is dominated 
by uncertainty in the residual variance in all cases. The only 3 a statement that can be 
made is that or, o"mf, and Mr ; b are better than mpp. 
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magnitude x = —23.4, and for i?-band magnitude x = —22.8. 

Given the hypothetical new measurement, we seek our best estimate of the error in the 
predicted BH mass and the uncertainty in this quantity. The error itself is obtained from 
our poor man's predictive probability distribution. To obtain the "error on the error" we 
produce 100 bootstrap Monte Carlo data sets from each data set by selecting data points 
at random with replacement. For each synthetic data set we fit a line and then evaluate 
the predictive uncertainty in the BH mass. The variance of the predictive uncertainties 
associated with each of the 100 synthetic data sets gives us the error on the error. 

In Figure 3, the peak of each curve is the most likely value of the uncertainty in a new 
BH mass measurement. The width of each curve shows the uncertainty in this quantity 
determined as described above by Bootstrap Monte Carlo simulations. It turns out that this 
is dominated by uncertainty in e resulting from small data sets for all predictors. 

The conclusion from Figure 3 is that nearly all of the curves overlap significantly, and 
this turns out to be because the residual variance is not well constrained. Most of the curves 
are similar between Figures 2 and 3, indicating that the predictive uncertainty of most of 
the relations is dominated by residual variance. 

Two correlations, with C-& e and <7mf, move significantly between the two figures, meaning 
that the predictive uncertainty is dominated by observational error. As noted in §2.2, this 
is consistent with the possibility that the estimated error bars for these two quantities are 
too large. If the error bars are accurate, then more accurate measurements of either of these 
two quantities will offer excellent BH mass inferences. Additional data will be able to test 
whether or not this is the case. 

Ideally, a comparison of different predictors of BH mass would be done using the same 
galaxy sample for each of the different predictors. However, the uncertainty associated 
with small sample sizes dominates both the uncertainty in the residual variance and the 
uncertainty in the predictive error in BH mass for the various relations. The sizes of the 
data sets must be larger before it becomes profitable to remove galaxies that do not appear 
in all of the data sets. 



5. Conclusions 

We have constructed a framework with which to compare the correlations between BH 
mass and various global galactic properties. For each galaxy quantity, we estimate the 
preferred values and uncertainties in the slope, offset, and residual variance of the relation. 
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Estimating the error on the residual variance is particularly important, since handling it 
systematically makes clear that it is not well enough constrained to draw strong distinctions 
between the power of various proposed correlations. We assume that all of the relations are 
power laws. It may be possible to reduce the residual variance of a relation at the expense 
of additional model parameters. 

Of the BH mass predictor variables considered here, we find that <xmf and Cr 6 seem to 
exhibit the smallest residual variance, and thus the tightest intrinsic correlations with the 
BH mass, but the only 3 a statement that can be made is that Cn e is better than Mj, mnaer, 
and 7«fp. Any such claims about the residual variance also depend critically on the assumed 
measurement errors. 

Concerning the use of these correlations to infer BH mass, we find that again the existing 
data sets do not offer a clear favorite. Some correlations may be marginally better than 
others, but the only 3 a statement is that <tt, o"mf, and Mr^ are preferable to M-pv- There 
does not seem to be a profound difference arising from the different ways of measuring 
velocity dispersion. 

The only two quantities for which BH mass inferences are dominated by measurement 
error are C^ e and omf- If the error bars on these quantities are correct, more accurate data 
should allow excellent BH mass inferences. 

The murkiness of the present situation is due almost entirely to the limited number of 
galaxies for which reliable BH mass measurements are available. Consistently handling the 
uncertainty in the residual variance due to the finite sample size reveals that there are simply 
too few data to make strong statements. 

We caution that all of these conclusions depend critically on the accuracy of the pub- 
lished error estimates in all quantities under consideration. The common practice of adopting 
conservatively large error bars with the intention of ensuring that the true value lies within 
the quoted range is undesirable in this case, since it artificially suppresses the amount of 
residual variance required by the fit. 
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Table 1. BH mass predictor variables m F p and mb u i g e f° r galaxies from Gebhardt et al. 

(2003) 



Name Distance (Mpc) R e (kpc) R e ref. mpp (M ) B — V (mag) B — V ref. wibuige {M, 



N0821 
N2778 
N3377 


24.1 ±4.1 
22.9 ±6.9 

11.2 ± 1.0 


11.63 
3.50 
1.84 


2 
2 
1 


2Mt° f 8 x 10 11 
4.29^1 x 10 10 
1.55^ x 10 10 


0.93 
0.91 
0.84 


4 
4 
5 


o i 7+0.47 
z " 1 ' -0.39 

4.201^ 

9 i O+0.31 
z - io -0.27 


N3384 


11.6 ± 1.6 


2.80 


2 


2.29l°|° x 10 10 


0.91 


5 


i-9ol8:lg 


N3608 


22.9 ±3.2 


3.91 


1 


5.19l°f x 10 10 


0.98 


5 


6-671^1 


N4291 


26.2 ±8.4 


4.38 


2 


l-03±g-|| x 10 11 


0.93 


4 


7-66±I;?I 


N4473 
N4564 


15.7 ±2.0 
15.0 ±2.5 


3.98 
1.59 


2 
1 


5.75l° : jj? x 10 10 
l.Q7t ll x 10 10 


0.92 
0.98 


4 
5 


1 C9+0.33 


N4649 
N4697 


16.8 ±2.5 
11.7 ± 1.6 


6.04 
4.25 


1 
1 


3.58+^ x 10 11 
5.33lg-jj? x 10 10 


0.99 
0.95 


5 
5 


c oq+1.15 
d -°°-0.96 
1 17 +0.22 
J-- 1 ' -0.18 


N5845 


25.9 ±5.4 


1.02 


3 


2.23l°$ x 10 10 


0.97 


5 


3.44+^? 


N7457 


13.2 ±2.8 


4.13 


2 


7Al±{f 4 x 10 9 


0.83 


5 


6-8llr.Ii 



Note. — All distances are from Tonry et al. (2001). Half-light radius (R e ) sources are: (1) Faber et al. '. 
(2) de Vaucouleurs et al. 1995; (3) Faber et al. 1989. B — V color sources are: (4) Faber et al. 1997; (I 
Vaucouleurs et al. 1995. 
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Table 2. Fit Parameters for Residual Variance in the x-Coordinate 



Quantity 


a 


P 


e 


Zero point 


N 


cg 


o.ioo ± u.uyy 


4.15 ± U.DI 


U.UbO ± U.Uzo 


zUU km s 




0"t 


8.155 ± 0.062 


4.59 ± 0.34 


0.063 ± 0.012 


200 km s" 1 


31 


0"MF 


8.117 ± 0.066 


4.54 ± 0.38 


0.030 ± 0.020 


200 km s" 1 


27 


m G eb 


8.38 ± 0.16 


1.55 ± 0.33 


0.329 ± 0.088 


10 11 M 


12 


mFP 


8.65 ± 0.20 


1.92 ± 0.42 


0.343 ± 0.086 


10 11 M Q 


12 


7™Haer 


8.348 ± 0.096 


1.38 ± 0.15 


0.293 ± 0.065 


10 11 M 


30 


C'Re 


8.438 ± 0.078 


6.56 ± 0.57 


< 0.023 


0.5 


21 


-M B 


8.39 ± 0.14 


0.83 ± 0.15 


0.52 ± 0.16 


-20 


12 


-Mj 


8.190 ± 0.081 


0.524 ± 0.056 


0.68 ± 0.12 


-23 


27 


-M R>B 


8.456 ± 0.084 


0.554 ± 0.059 


0.55 ± 0.14 


-22 


20 


-M RM 


8.37 ± 0.10 


0.537 ± 0.072 


0.65 ± 0.16 


-22 


18 



Table 3. Fit Parameters for Residual Variance in the y- Coordinate 



Quantity 


a 




p 




e 


Zero point 


N 




8.143 ± 0.094 




3.69 ± 0.51 


0. 


.261 ± 0.091 


200 km s _1 


12 




8.133 ± 0.059 




4.10 ± 0.30 





,272 ± 0.050 


200 km s" 1 


31 


cmf 


8.116 ± 0.065 




4.42 ± 0.37 





433 ± 0.090 


200 km s" 1 


27 


^Geb 


8.24 ± 0.13 




1.05 ± 0.22 




0.42 ± 0.12 


10 11 M 


12 


777.FP 


8.34 ± 0.15 




1.07 ± 0.25 




0.49 ± 0.13 


10 11 M 


12 


^Haer 


8.291 ± 0.086 




1.13 ± 0.12 


0. 


.365 ± 0.080 


10 11 M 


30 


CRe 


8.438 ± 0.078 




6.56 ± 0.57 




< 0.15 


0.5 


21 


-M B 


8.28 ± 0.12 




0.63 ± 0.12 




0.37 ± 0.12 


-20 


12 


-Mj 


8.190 ± 0.074 


0. 


.426 ± 0.046 


0. 


.320 ± 0.058 


-23 


27 


-M R>B 


8.430 ± 0.080 


0. 


.488 ± 0.051 


0. 


.285 ± 0.075 


-22 


20 


-M RM 


8.325 ± 0.092 


0. 


.441 ± 0.058 





,314 ± 0.081 


-22 


18 
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Table 4. Correlation Coefficients with Confidence Regions. 



Quantity 


N 


Pearson r 


Confidence Region 


Spearman r 


Confidence Region 




12 


0.89 


[-0.62, 0.998] 


0.85 


[-0.72, 0.998] 


0"t 


31 


0.92 


[0.82, 0.96] 


0.92 


[0.82, 0.96] 


0"MF 


27 


0.92 


[0.81, 0.97] 


0.92 


[0.82, 0.97] 


™ G eb 


12 


0.78 


[-0.81, 0.997] 


0.63 


[-0.89, 0.99] 


mpp 


12 


0.71 


[-0.85, 0.995] 


0.59 


[-0.90, 0.99] 




30 


0.84 


[0.67, 0.92] 


0.84 


[0.68, 0.93] 


CRe 


21 


0.89 


[0.67, 0.97] 


0.93 


[0.76, 0.98] 


-M fl 


12 


0.81 


[-0.77, 0.997] 


0.60 


[-0.90, 0.99] 


-M, 


27 


0.88 


[0.73, 0.95] 


0.86 


[0.69, 0.94] 


-M^b 


20 


0.89 


[0.62, 0.97] 


0.88 


[0.61, 0.97] 




18 


0.86 


[0.46, 0.97] 


0.85 


[0.41, 0.97] 



Note. — Confidence regions are 68% regions computed assuming that tanh 1 r is 
distributed normally, as described in Cowan (1998, p. 129) 



